Algorithm-irrelevant Privacy Protection Method Based on Randomization
نویسنده
چکیده
Privacy preserving classification mining is one of the fast-growing subareas of data mining. The algorithm-related methods of privacy-preserving are designed for particular classification algorithm and couldn’t be used in other classification algorithms. To solve this problem, it proposes a new algorithm-irrelevant privacy protection method based on randomization. This method generates and opens a new data set that is different from the original data set independently as the perturbed data. The perturbed data and the original data have the same distribution. Users get the models of the original data from the perturbed data. Experimental results demonstrate that the classification algorithms can be used on the perturbed data directly. And this method reduces the privacy data disclosure risk more effectively.
منابع مشابه
A New Algorithm-independent Method for Privacy-Preserving Classifica- tion Based on Sample Generation
With the development of data mining technologies, privacy protection is becoming a challenge for data mining applications in many fields. To solve this problem, many PPDM (privacy-preserving data mining) methods have been proposed. One important type of PPDM method is based on data perturbation. Only part of the data-perturbation-based methods is algorithm-irrelevant, which are favorable becaus...
متن کاملDifferentially Private Local Electricity Markets
Privacy-preserving electricity markets have a key role in steering customers towards participation in local electricity markets by guarantying to protect their sensitive information. Moreover, these markets make it possible to statically release and share the market outputs for social good. This paper aims to design a market for local energy communities by implementing Differential Privacy (DP)...
متن کاملA novel anonymization algorithm: Privacy protection and knowledge preservation
In data mining and knowledge discovery, there are two conflicting goals: privacy protection and knowledge preservation. On the one hand, we anonymize data to protect privacy; on the other hand, we allow miners to discover useful knowledge from anonymized data. In this paper, we present an anonymization method which provides both privacy protection and knowledge preservation. Unlike most anonymi...
متن کاملA Data and Knowledge Driven Randomization Technique for Privacy-Preserving Data Enrichment in Hospital Readmission Prediction
In health care predictive analytics, limited data is often a major obstacle for developing highly accurate predictive models. The lack of data is related to various factors: minimal data available as in rare diseases, the cost of data collection, and privacy regulation related to patient data. In order to enable data enrichment within and between hospitals, while preserving privacy, we propose ...
متن کاملPrivacy Preserving Data Mining in Health Care Applications
In recent years data analysis and protection are the two important parameters in any organization to improve the efficiency and privacy. In hospital management, privacy plays a crucial role. In order to hide the patient’s related databases effective data mining algorithms are to be implemented. In this project we are going to create a database and perform different algorithms on it, so as to ma...
متن کامل